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8. □ 
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This IS a FIRS T submission of items concerning a filing under 35 U.S.C. 371. 

This is a SECOND or SUBSEQUENT submission of items concerning a Cilmg under 35 U.S.C. 371. 

'1 his IS an express request to begin national examination procedures (35 U.S.C. 37 1(0). 

'1 he US has been elected by the expiration of 19 nionlhs from tlie priority date (I'CT Article 3 1 ). 

A copy of thclnternational Application as filed (35 U.S.C. 371(c)(2)) ~ 

a □ is attached hereto (required only if not communicated by the International Bureau). 

b. H has been communicated by the International Bureau. 

c. n IS not required, as the application was filed in the United Stales Receiving OITicc (RO/US). . . 

An English language translation of the International Application as filed (35 U.S.C'. 371(c)(2)). 

Amendments to the claims of the International application under PCT Article 1 9 (35 U.S.C. 371(c)(3)) 

a. [H are attached hereto (required only it not communicated by the International Bureau), 
have been communicated by the International Bureau. ; v- ; - 

c. □ have not been made; however, the time limit for making such amendments has NOT expired. . 

d. □ have not been made and will not be made. 

An luiglish language translation of the amendments to (he claims under PCI" Article 19.(35 U.S.C. 372(c)(3)). 
An oath or declaration of the inventor(s) (35 U.S.C. 371(c)(4)). 

An English language translation of the annexes to the International Prclmiinary Examination Report under 
PCT Ar(iclc-36 (35 U.S.C. 371(c)(5)). 
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11. □ An Information Disclosure Statement under 37 C.l-.R. 1.97 and 1.98. 
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i The following fees arc submitted: 



Basic NaJional Fee (37 CFR 1.492(a)(l)-(5)): 
•Neither international preliminary examination fee (37 CFR 1.482) 
nor international search fee (37 CFR 1.445(a)(2)) paid to USPTO 

and hiternational Search Report not prepared by the EPO or JPO S1040.00 

hitcrnational preliminary examination fee (37 CFR 1 .482) not paid to 

USPTO but International Search Report prepared by the EPO or JPO S890.00 

International preliminary examination fee (37 CFR 1 .482) not paid to USP l O but 

international search fee (37 CFR 1 .445(a)(2)) paid to USPTO S740.00 

International preliminary examination fee paid to USPTO (37 CFR 1 .482) 

but all claims did not satisfy provisions of PCT Article 33(l)-(4) $690.00 

International preliminary examination fee paid to USPTO (37 CFR 1 .482) 
and all claims satisfied provisions of PCT Article 33(2)-(4) S 100.00 



ENTER APPROPRIATE BASIC FEE AMOUNT 



Surcharge of $130,00 for furnishing the oath or declaration later than 0 20 0 30 months 
from the earliest claimed priority date (37 CFR 1 .492(e)). 



Claims. 


Number Filed 


Nufiiber Extra 


Rale 




Tola! Claims 


26-20 = 


6 


X $18.00 


$ 54.00 




Independent Claims 


3- 3 = 


0 


X $84.00 


% 




Multiple dependent claim(s) (li applicable) 


+ $280.00 


% 




TOTAL OF ABOVE CALCULATIONS = 


$ 




■ Applicant claims small entity stains. See 37 CFR 1 .27. The Ices indicated above arc 
reduced by 1/2. 


.$ 




SUBIOTAL = 


$54.00 




Processing fee of $130 00 lor luriiishing the l-.nglish translation later lhan □ 20 □ 30 months from the 
earliest claimed priority date (37 CFR 1.492(0), + 


$ 




l OTAL NATIONAL FEE = 


$ 




Fee for recording the enclosed a.ssignmcnt (37 CI R 1.21(h)). 1 he a.ssigninent must be accompanied by 
an appropriate cover sheet (37 CFR 3.28, 3.3 1 ). $40.00 per property + 


$ 




TOTAL FEES ENCLOSED = 


$574.00 






refunded: 


$ - . 




charged: 


$ . 



1. L2sl A check in the amount of $574.00 to cover the above fees is enclosed. 

). n Please charge my Deposit Account No. in the amount of $ _ 

sheet is enclosed. 



;r the above fees. A duplicate copy of this 



c. CS The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any overpayment to Deposit 

Account No. 19-0036 . A duplicate copy of this sheet is enclosed. 
NOTE: Where an appropriate time limit Under 37 CFR 1.494 or 1.495 has not been met, a petitior 
must be filed and granted to restore the application to pending status. 
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Sterne, Kessler, Goldstein & Fox p.l.l.c. 

1 100 New York Avenue, NW, Suite 600 
Washington, D.C. 20005-3934. 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re application of: 
Rouleau et al. 

Appl. No. To be assigned (Nat'l Phase oflnt'l 
Appl. No. PCT/CAOO/01052) 

Filed: March 8, 2002 (Int'l Filing Date: Septembers, 
2000) 

For: Diagnosis, Prognosis and Treatment of 
Trinucleotide Repeat-Associated 
Diseases and Intranuclear Inclusions- 
Associated Diseases 



Art Unit: To be assigned 
Examiner: To be assigned 

Atty. Docket: 1619.01 10000/SRL/KYP 



Preliminary Amendment 

Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

Please enter the following amendments prior to examination of the above- 
referenced application. This Amendment is provided in the following format: 

(A) A clean version of each replacement paragraph/section/claim along 

with clear instructions for entry; 

(B) Starting on a separate page, appropriate remarks and arguments. 37 
C.F.R. § 1.111 andMPEP714; and 

(C) Starting on a separate page, a marked-up version entitled: " Version 
with markings to show changes made. " 

It is not believed that extensions of time or fees for net addition of claims are 
required beyond those that may otherwise be provided for in documents accompanying 
this paper. However, if additional extensions of time are necessary to prevent 
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abandonment of this application, then such extensions of time are hereby petitioned 

under 37 C.F.R. § 1.136(a), and any fees required therefor (including fees for net 

addition of claims) are hereby authorized to be charged to our Deposit Account No. 

19-0036. 

Amendments 

In the Claims: 

Please cancel claims 3-4, 7-10, 13, and 15 without prejudice or disclaimer. 

Please add the following claims: 

16. (new) The method of claim 1 , wherein said disease is a neurological 
disease. 

17. (new) The method of claim 2, wherein said disease is a neurological 
disease. 

18. (new) The method of claim 1 , wherein said determining is carried out 
with one of a ligand and/or a nucleic acid sequence. 

19. (new) The method of claim 2, wherein said determining is carried out 
with one of a ligand and/or a nucleic acid sequence. 
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20. (new) The method of claim 16, wherein said determining is carried out 
with one of a ligand and/or a nucleic acid sequence. 

2 1 . (new) The method of claim 1 7, wherein said determining is carried out 
with one of a ligand and/or a nucleic acid sequence. 

22. (new) The method of claim 5, wherein said polyamino acid stretch- 
containing protein is expressed by an expression vector which comprises a repeat 
domain. 

23. (new) The method of claim 6, wherein said polyamino acid stretch- 
containing protein is expressed by an expression vector which comprises a repeat 
domain. 

24. (new) The method of claim 22, wherein said polyamino acid stretch- 
containing protein is a polyalanine stretch-containing protein. 

25. (new) The method of claim 23, wherein said polyamino acid stretch- 
containing protein is a polyalanine stretch-containing protein. 

26. (new) The method of claim 24, wherein said polyalanine stretch is 
encoded by a CAG repeat. - 
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27. (new) The method of claini 25, wherein said polyalanine stretch is 
encoded by a CAG repeat. 

28. (new) The method of claim 26, wherein said CAG repeat is an 
uninterrupted CAG tract. 

29. (new) The method of claim 27, wherein said CAG repeat is an 
uninterrupted CAG tract. 

30. (new) The method of claim 24, wherein said polyalanine stretch is 
encoded by a GCG repeat. 

3 1 . (new) The method of claim 25, wherein said polyalanine stretch is 
encoded by a GCG repeat. 

32. (new) The method of claim 14, wherein said GCG repeat is present in the 
PABP2 gene. 

33. (new) The method of claim 30, wherein said GCG repeat is present in the 
PABP2 gene. 

34. (new) The method of claim 31, wherein said GCG repeat is present in the 
PABP2 gene. 
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Remarks 



Upon entry of the foregoing amendment, claims 1-2, 5-6, 1 1-12, 14, 16-34 are 
pending in the application, with 1, 5, and 12 being the independent claims. Claims 3-4, 
7-10, 13, and 15 are sought to be cancelled without prejudice to or disclaimer of the 
subject matter therein. New claims 16-34 are sought to be added. These changes are 
believed to introduce no new matter, and their entry is respectfully requested. 

If the Examiner believes, for any reason, that personal communication will 
expedite prosecution of this application, the Examiner is invited to telephone the 
undersigned at the number provided. 



Respectfully submitted. 



Sterne, Kessler, Goldstein & Fox p.l.l.c. 



Date: 




1 100 New York Avenue, N.W. 
Suite 600 

Washington, D.C. 20005-3934 
(202)371-2600 
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Version with markings to show changes made 

Claims 3-4, 7-10, 13, and 15 are cancelled. 
Claims 16-34 are added. 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re application of: 
ROULEAU et al. 

Appl. No. 10/070,664 (which is the U.S. 
Natl. Stage of PCT Appl. No. 
PCT/CAOO/01052) 

I.A. Filing Date: September 8, 2000 

For: Diagnosis, Prognosis and 

Treatment of Trinucleotide 
Repeat-Associated Diseases and 
Intranuclear Inclusions- 
Associated Diseases 



Confirmation No. 9379 

Art Unit: To be assigned 

Examiner: To be assigned 

Atty. Docket: 1619.01 10000/SRL/AGU 



Second Preliminary Amendment and Submission 
of Sequence Listing 



Assistant Commissioner for Patents 
Washington, D.C. 2023 1 

Sir: 

In response to the Notice to File Missing Parts of Application, dated June 10, 2002, 
in the above identified matter, and in advance of prosecution, please amend the application 
as follows: 



In the Specification: 

Please insert the sequence listing at the end of the application. 



Remarks 

No new matter has been added. The specification has been amended to direct the 
entry of this sequence listing after the claims of the above identified application. 

In accordance with 37 C.F.R. § 1 .821(g), this submission includes no new matter. 
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Appl. No. 10/070,664 (which 
is the U.S. Natl. Stage of PCT 
Appl. No. PCT/CAOO/0 1 052) 



In accordance with 37 C.F.R. § 1 .82 1 (f), the paper copy of the Sequence Listing and 
the computer readable copy of the Sequence Listing submitted herewith in the above 
application are the same. 



Summary 



It is respectfully believed that this application is now in condition for examination. 
Early notice to this effect is respectfully requested. 



Respectfully submitted, 

Sterne, Kessler, Goldstein & Fox p.l.l.c. 




Angelique G. Uy 
Agent for Applicants 
Registration No. 48,832 

Date: In^lO^O')-^ 

1 100 New York Avenue, N.W. 
Suite 600 

Washington, D.C. 20005-3934 
(202) 371-2600 
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SEQUENCE LISTING 

SEQIDNO:1 TTTTAAGCTTAGACAAATAAACATGGAG 

SEQ ID NO:2 CCGGTGGATCCCTCATCCTGATAGGTCCCGCTGCTG 

5 SEQ ID NO:3 CCGGTGGATCCCTCATTGATAGGTCCCGCTGCTG 

SEQ ID NO:4 TTTAAGC7TCCCACCATGGAGTCATCTTCCA 

SEQ ID NO:5 CCGGTGGATCCCTCAGGGCGTAGTCGGGGACGTCG- 

TAGGGGTACATGGATGTGAACTCTGTCCTGATAGGTCCCG 
CTG 

10 SEQIDNO:6: CCGGTGGATCCCAGGGCGTAGTCGGGGACGTCGTAGGG- 
GTACATGGATGTGAACTCTGTCCTGATAGGTCCCGCTG 

SEQ ID NO:7 CGGAAGAGACGAGAAGCCTACTCCGGAAAAACAGCAGCA- 
AAAGCAGC 



SUBSTITUTE SHEET (RULE 26) 
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TITLE OF THE INVENTION 

.DjAGNQSI^, PROGNOSIS AND_ TREATMENT OF 
TRimJCLE OIIDE REPEAT -ASS OCIATED DIS EASES AND JbjlRANUCLEAR 
JNCLy SIONS-ASSOCIATE DDISEASES 

5 FIELD OF THE INVENTION 

The present invention relates to neurodegenerative disorders. 
More particularly, the invention relates to Machado-Joseph disease (MJD). The 
present invention also relates to trinucleotide repeat expansions and more 
particularly to CAG repeats, also termed expansions of a coding CAG repeat 

10 (exp-CAG). and GCG repeats. More particularly, the invention relates to: (1) 
exp-CAG associated diseases; (2) intranuclear inclusions (INI) in patients and 
cellular models of exp-CAG associated diseases; and (3) the elucidation of the 
mechanism responsible for the toxic effects of such repeats. The present 
invention therefore relates to the diagnosis, prognosis and treatment of repeat- 

15 associated diseases and INi-associated diseases as well as to assays for the 
identification of agents which could be used for the treatment of such diseases or 
disorders. 

BACKGROUND OF THE INVENTION 

Coding CAG triplet repeat expansions cause several 
20 neurodegenerative disorders, including Machado-Joseph disease (MJD)*^^ The 
presence of intranuclear filamentous inclusions (INI) containing expanded protein 
in MJD, as well as in other expanded CAG repeat disorders ^xp-CAG), have lead 
to a nuclear toxicity model ^- Similar INI are found in oculopharyngeal 
muscular dystrophy, which is caused by a short expansion of analanine encoding 
25 GCG repeat. According to the present invention, it is proposed that 
transcriptional or translational frameshifts occurring within expanded CAG tracts 
result in the production and accumulation of polyalanine-containing mutant 
proteins. These alanine polymers might deposit in cells forming INI and lead to 
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nuclear toxicity. Support for tliis disease model is provided using lymphoblast 
ceils from MJD patients, as well as in pontine neurons of MJD brain and in//? vitro 
cell culture models of the disease. Evidence that alanine polymers- alone are 
toxic to cells is also provided and strongly suggests that a similar pathogenic 

5 mechanism underlies the other CAG repeat disorders. 

Indeed, recent reviews describe a significant number of 
neurodegenerative diseases, including Huntington disease as well as spinal 
cerebellar ataxias which are caused by CAG repeat expansions. Of note, CAG 
repeats code for polyglutamine in the protein containing same. It is commonly 

1 0 believed that these polyglutamine stretches in proteins are toxic to cells, and 
these repeats are also termed CAG/polyglutamine repeats (Iveret al. 1999, 
Nature Medicine 5:383-384). These diseases, also termed polyglutamine 
diseases, are thought to occur by "a gain function mechanism". Unfortunately, 
the mechanism explaining toxicity of the polyglutamine diseases, apparently 

15 through an aggregation in nuclear inclusions, has yet to be provided, although 
transgenic mice bearing a polyglutamine repeat in a recombinant protein were 
shown to display intranuclear polyglutamine inclusions (Hardy et al. 1998, 
Science 282 :1076-1079). Of relevance, although the pathogenic effect of these 
inclusion bodies is not clearty understood, it is recognized that numerous types 

20 of genes can contain these so-called GAG repeats, and that while these repeats 
are linked to the disease, the genes containing these repeats are "largely 
irrelevant to the disease process" (Hardy et al. ipid., supra). 

INI are also found in oculopharyngeal muscular dystrophy 
(OPMD)''. which is caused by short expansions of a poiyalanine (poiyAla) 

25 encoding GCG tract in the PABP2 gene" (also see PCT/CA98/01133). In 
contrast to the GAG repeat disorders, where expansions frequently involve the 
addition of 20 or more codons, very small GCG expansions (exp-GGG) of 2 to 7 
additional codons are seen in dominant OPMD, suggesting that poiyalanine tracts 
are prone to aggregation and may be very toxic". This contention is supported 
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by the observation thatpolyAla peptides containing more than 9 alanines (Ala) in 
a row fomi p-pleated sheet fibrillar macromolecules spontaneously in vrfro", 
which in turn are extremely resistant to chemical and enzymatic degradation". 

Short trinucleotide repeat expansions causing a human 
5 disease have been first described in PCT application number PCT/CA98/01 1 3, 
of Rouleau et al. . which teaches that the addition of only two GCG repeats (which 
ericode the amino acid alanine [ALA]) is sufficient to cause dominant OPMD. 
OPMD expansions do not share the cardinal features of "dynamic mutations'. The 
GCG expansions are not only short they are also meiotically quite stable. 
1 0 Furthermore, there is a clear cut-off between the normal and abnormal alleles: a 
single GCG expansion causing a recessive phenotype. The PAB II (GGQ7 allele 
was thus the first example of a relatively frequent allele which can act as either 
a modifier of a dominant phenotype or as a recessive mutation. A dosage effect 
of these repeats is also disclosed in PCT/CA98/01 1 33. since a patient having an 
15 expansion in the poly alanine tract of the HOXD13 protein (Akarsu,. et a!.. 1996, 
Hum. Mol. Genet. 5: 945-952) has more severe deformities. A duplication causing 
a similar polyalanine expansion in the subunit 1 gene of the core-binding 
transcription factor CBF(1 ) has also been found to cause dominant cleido-cranial 
dysplasia (Mundlos, S. et al.,1997. Cell 89:773-779). Of note, however, the 
20 mutations in these two rare diseases are not triplet-repeats. They are duplications 
of "cryptic repeats" composed of mixed synonymous codons and are thought to 
result from unequal crossing over (Warren, 1997. Science 275 : 408-409). In the 
case of OPMD. slippage during replication causing a reiteration of the GCG 
codoh is a more likely mechanism (Wells, 1996, J. Biol. Chem. 271: 2875-2878). 
25 Different observations converge to suggest that a gain of 

function of PAB !l may cause the accumulation of nuclear filaments observed in 
OPMD (Tome et al.. 1980, Acta Neuropath. 49: 85-87). PAB 11 is found mostly in 
dimeric and oligomeric form (Nemeth, et al..1995. Nucleic Acids Res.23: 4034- 
4041). It is possible that the polyalanine tract plays a role in polymerization. 
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Poiyalanine stretches have been found in many other nuclear proteins such as 
the HOX proteins, but their function is still unknown (Davies. et al., 1997. CellGO: 
537-548). Alanine is a highly hydrophobic amino acid present in the cores of 
proteins. In dragline spider silk, poiyalanine stretches are thought to form B-sheet 
5 structures important in ensuring the fibers' strength (Simmons, A.H. et al.. 
Science 271:4 -87 (1996)). Poiyalanine oligomers have also been shown to l>e 
extremely resistant to chemical denaturation and enzymatic degradation (Forood 
et al. 1995, Bioch. and Biophy. Res. Com. 211:7-13). Their role in the disease 
process, however, has still not been clearly identified. The more severe 

10 phenotypes observed in homozygotes for the (GCG)9 mutations and compound 
heterozygotes for the (GCG)9 mutation and (GCG)7 allele may correspond to the 
fact that in these cases PAB II oligomers are composed only of mutated proteins. 
The ensuing faster filament accumulation could cause accelerated cell death.The 
recent description of nuclear filament inclusions in Huntington's disease, raises 

15 the possibility that "nuclear toxicity" caused by the accumulation of mutated 
homopolymeric domains is involved in the molecular pathophysiology of other 
triplet repeat diseases (Davies. S.W. et al.. Cell 90:537-548 (1997); S<^erzinger, 
E. et al.. Cell 90:549-558 (1997); DiFiglia. M. et al., Science 277:1990-1993 
(1997)). Additional data, including immunocytochemical and expression studies 

20 will have to be provided to test this pathophysiological hypothesis and provide 
some insight into why certain muscle groups are more affected, while all tissues 
express PAB II. 

There thus remains a need to elucidate the mechanism by 
which GCG and CAG expansions are toxic to cells. There also remains a need 
25 to provide diagnosis and/or prognosis and/or treatment tools for diseases 
associated with GCG or CAG repeats. 

The present invention seeks to meet these and other needs. 
The present description refers to a number of documents, the 
content of which is herein incorporated by reference in their entirety. 
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SUMMARY OF THE iNVENTtON 

The invention therefore concerns the identification of the 
mechanistic action of expanded CAG tracts in cell pathogenesis, cell death or 
disease. More specifically, the invention relates to the identification that 
5 mutational events within these CAG enable them to encodepolyalanine stretches 
which accumulate in intranuclear filamentous inclusions and somehow trigger 
toxicity in cells. In particular, the present invention relates totranslational and/or 
transcriptional frameshifl events occurring within the CAG tracts, thereby resulting 
in the production and accumulation of polyaianine-containing mutant proteins. 

10 The present invention, in addition, relates to a formal 

identification of the toxic effects of polyalanine stretches on nuclear toxicity. The 
Applicant is thus the first to have demonstrated that CAG expansions can give 
rise to production of mutant proteins containing polyalanine stretches. The 
Applicant is also the first to have demonstrated that polyalanine stretches in 

15 proteins are indeed toxic to cells. 

The instant invention also relates to gene replacement 
technologies aimed at deleting a repeat-containing protein giving rise to a mutant 
protein by a normal corresponding protein (i.e. lacking the repeat or having a 
smaller repeat). 

20 The present invention also provides the means to determine 

a predisposition to developing a disease or condition associated with the 
expression of a poly amino acid-containing protein such as polyaianine-containing 
proteins. This determination could thus enable a better prognosis of the disease 
and condition and enable a determination of the best treatment or prevention of 

25 the disease or condition. 

Another aim of the present invention is thus to provide means 
to screen humans (and more broadly animals) to identify those that might have 
a predisposition to developing a disease associated with the expression of genes 
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that lead to polyalanine-containing proteins such as CAG, GCG-repeat containing 
genes. 

it is thus an aim of the present invention to provide the means 
to better manage such disease prevention and intervention programs. 
5 Before the present invention. CAG repeat expansions (also 

known as CAG repeats or CAG/poiyglutamine stretches) were recognized as a 
common denominator in numerous neurological diseases and were thought to 
code for polyglutamine stretches in the mutant protein. These polyglutamine 
stretches were thought to confer "a gain of toxic property to these proteins" 

10 (Iver et al. supra) by a mechanism that was not understood (Hardy et al. supr^. 
indeed, CAG tract toxicity was also referred to as polyglutamine diseases. The 
present invention demonstrates that these CAG repeats actually encode 
polyalanine stretches. 

Prior to the present invention, the identification of CAG 

1 5 repeats in a gene correlating with a neurological disease led to the classification 
of such disease in the polyglutamine diseases (Hardy et al. supra). The present 
invention now demonstrates that polyalanine stretches, as opposed to 
polyglutamine stretches, are responsible for the diseases. 

In view of the atiove, the present invention opens the way to 

20 numerous methods of diagnosing, prognosing or treating CAG repeat-dependent 
diseases or conditions. In addition, it provides means to diagnose, prognose and 
treat diseases or conditions associated with polyalanine-containing proteins (i.e. 
GCG repeats). Non-limiting examples thereof comprise methods using ligands 
(i.e. polyclonal and monoclonal antibodies), nucleic acid sequences, restriction 

25 length polymorphisms (RFLPs) and the like. 

While the instant invention is more particularly directed to 
neurological diseases, as is demonstrated with Machado-Joseph disease, it 
should be understood that the present invention should not be so limited. Indeed, 
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polyaianine stretches have been shown to be responsible for non-neurological 
diseases such as. for example, a muscle disease (OPMD; PCT/CA98/01133). 

In order to better understand the disease process associated 
with the presence of INI in exp-CAG and exp-GCG associated disorders, a direct 
5 assessment of whether polyAla stretches accumulated as Intranuclear protein 
aggregates was carried out. Experiments were thus performed to analyze 
whether rare transcriptional or translational frameshifts in large CAG stretches 
resulting in new reading frames with the fomaation of a hybrid protein containing 
a mixed polyglutamine/poly-alanine tract occurred. Additionally, It was of 
importance to assess whether the resultant polyAla peptides accumulate in nuclei 
where they fonm INI. < 

Surprisingly, it was discovered that an antiserum raised 
against the hypothetical COOH terminus of the predicted polyAla containing 
frameshifted ataxin-3 protein (MJD-Ala) detects the frameshifted species In 
lymphoblastoid cells from MJD patients with large CAG tracts. This antiserum 
detects these polyAla tracts as insoluble macromolecules on Western blots, and 
as intranuclear inclusions by immunocytochemistry. Frameshifted species were 
also present in INI in pontine neurons of MJD brain. Transfection of COS-7 cells 
with full-length MJD-1 fused to the enhanced green fluorescence protein (EGFF) 
gene in the alternative polyAla reading frame also leads to EGFP accumulation 
preferentially when the CAG tract is expanded. 

Of interest, it is also demonstrated that long CAG repeats are 
prone to frameshifts, which result in accumulation of the predicted polyAla- 
containing Inclusions. Transfection of COS-7 cells with mutated MJD-1 
constructs containing alanine-coding GCA stretches results in a more severe 
phenotype when compared to their CAG counterparts. Furthermore, transfected 
polyAla-encoding GCA stretches alone are toxic and form aggregates. 

How these accumulations lead to cell death still needs to be 

elucidated. 
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A frameshift error occurring within a CAG tract thus results in 
the alternate alanine-encoding GCA frame. Many authors have reported 
frameshifts at the level of transcription or translation. The observation of 
transcriptional errors of the p-amyloid precursor protein and ubiqurtin-B in 
5 Alzheimer's disease'"*, and apoB86", supports the existence of such errors, and 
their role in disease pathogenesis. Translational errors have also fc>een shown to 
occur and may be the basis for the formation of frameshifted proteins'*. 

In accordance with one embodiment of the present invention, 
there is therefore provided a method for the diagnosis of a disease associated 

10 with protein accumulation in intranuclear inclusions, which comprises obtaining 
a sample of a patient and determining a presence of the protein accumulation in 
the intranuclear inclusions, wherein this protein accumulation is indicative of a 
disease related thereto. 

In accordance with another embodiment of the present 

1 5 invention, there is also provided a method for the screening of agents which can 
modulate at least one of (1) a polyamino acid-containing protein expression; (2) 
accumulation of polyamino acid-containing proteins in intranuclear inclusions; and 
(3) toxicity to cells, which comprises: a) incubating a cell hartwring an expression 
vector of the present invention, comprising a repeat domain which can give rise 

20 to a polyamino acid-containing protein associated with a disease or condition in 
an animal, with a compound; and b) assessing one of (1) an expression of the 
polyamino acid-containing protein; (2) accumulation of the polyamino acid- 
containing protein; and (3) toxicity to cells; whereby a modulator is selected when 
the agent significantly modulates one of the expression, accumulation and 

25 toxicity, as compared to a control agent. 

The instant invention also relates to GCG repeatis encoding 
polyalanine stretches and their association with protein accumulation in a cell 
nudeus, swallowing difficulty and/or ptosis in a patient. In accordance with 
another embodiment of the present invention, there is provided a method for the 
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diagnosis or prognosis of a disease associated with protein accumulation in a cell 
nucleus, and/pr swallowing difficulty and/or ptosis in a human patient, which 
comprises: 

a) obtaining a sample of a patient; and 

5 b) determining the extract of the polyalanine stretch in an alanine 

stretch-containing protein having the amino acid sequence: 

Met (Ala)e*„Ala. 
wherein n is selected from 0 to 7, and 

whereby an n equal to 1 to 7 is indicative of a disease related 
1 0 with the protein accumulation in the nucleus, and/or a swallowing difficulty and/or 
ptosis in the patient. In a related aspect of the present invention, there is provided 
a human PAB II protein comprising a polymorphic GCG repeat encoding a 
polyalanine stretch having the sequence 
Met (Ala)6*n Ala. 

15 wherein n is 0. and wherein the sequence is indicative of a non-disease 
phenotype associated with protein accumulation in a cell, nucleus, swallowing 
difficulty, and/or ptosis in a human patient. 

In order to provide a clear and consistent understanding of 
terms used in the present description, a number of definitions are provided 

20 hereinbelow. 

Nucleotide sequences are presented herein by single strand, 
in the 5' to 3' direction, from left to right, using the one letter nucleotide symbols 
as commonly used in the art and in accordance with the recommendations of the 
lUPAC-lUB Biochemical Nomenclature Commission. 
25 Unless defined othenn/ise. the scientific and technological 

terms and nomenclature used herein have the same meaning as commonly 
understood by a person of ordinary skill to which this invention pertains. 
Generally, the procedures for cell cultures, infection, molecular biology methods 
and the like are common methods used in the art Such standard techniques can 
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be found in reference manuals such as for example Sambrook et al. (1989. 
Molecular Cloning - A Laboratory Manual, Cold Spring Harbor Laboratories) and 
Ausubel et al. (1994. Current Protocols in Molecular Biology. Wiley, New York). 

The present description refers to a number of routinely used 
5 recombinant DNA (rDNA) technology terms. Nevertheless, definitions of selected 
examples of such rDNA terms are provided for clarity and consistency. 

As used herein, "nudeic acid molecule", refers to a polymer of 
nucleotides. Non-limiting examples thereof include DNA (i.e, genomic DNA, 
cDNA) and RNA molecules (i.e. mRNA). The nucleic acid molecule can be 
1 0 obtained by cloning techniques or synthesized. DNA can be double-stranded or 
single-stranded (coding strand or non-coding strand [antisense]). 

The term "recombinant DNA" as known in the art refers to a 
DNA molecule resulting from the joining of DNA segments. This is often referred 
to as genetic engineering. 
15 The term "DNA segment", is used herein, to refer to a DNA 

molecule comprising a linear stretch or sequence of nucleotides. This sequence 
when read in accordance with the genetic code, can encode a linear stretch or 
sequence of amino acids which can be referred to as a polypeptide, protein, 
protein fragment and the like. 
20 The terminology "amplification pair" refers herein to a pair of 

oligonucleotides (oligos) of the present invention, which are selected to be used 
together In amplifying a selected nucleic acid sequence by one of a number of 
types of amplification processes, preferably a polymerase chain reaction. Other 
types of amplification processes include Hgase chain reaction, strand 
25 displacement amplification, or nucleic acid sequence-based amplification, as 
explained in greater detail below. As commonly known in the art. theoligos are 
designed to bind to a complementary sequence under selected conditions. 

The nucleic acid (i.e. DNA or RNA) for practicing the present 
invention may be obtained according to well known methods. 
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As used herein, the term "physiologically relevant" is meant to 
descrit>e a frameshifting event which can result in the production of a toxic protein 
in vivo. 

Oligonucleotide probes or primers of the present invention 
5 may be of any suitable length, depending on the particular assay format and the 
particular needs and targeted sequences employed. In general, the 
oligonucleotide probes or primers are at least 12 nucleotides In length, preferably 
kjetween 15 and 24 molecules, and they may be adapted to be especially suited 
to a chosen nucleic acid amplification system. As commonly known in the art, the 
10 oligonucleotide probes and primers can be designed by taking into consideration 
the melting point of hydrizidation thereof with its targeted sequence (see below 
and in Sambrook et al.. 1989, Molecular Cloning - A Laboratory Manual, 2nd 
Edition, CSH Laboratories; Ausubel et al., 1989, in Current Protocols in Molecular 
Biology, John Wiley & Sons Inc.. N.Y.). 
1 5 The terms "oligonucleotide" or "DNA" molecule or sequence 

refers to a molecule comprised of thedeoxyribonucleotides adenine (A), guanine 
(G), thymine (T) and/or cytosine (C), in a double-stranded or single-stranded 
form. The term "oligonucleotide" or "DNA" can be found in linear DNA molecules 
or fragments, viruses, plasmids. vectors, chromosomes or synthetically derived 
20 DNA. As used herein, particular DNA sequences may be described according to 
the normal convention of giving only the sequence in the 5' to 3' direction. 

"Nucleic acid hybridization" refers generally to the 
hybridization of two single-stranded nucleic acid molecules having 
complementary base sequences, which under appropriate conditions will form a 
25 thermodynamically favored double-stranded structure. Examples of hybridization 
conditions can be found in the two laboratory manuals referred above ^ambrook 
et al., 1989. supra and Ausubel et al., 1989, supra) and are commonly known in 
the art. In the case of a hybridization to a nitrocellulose filter, as for example in the 
well known Southern blotting procedure, a nitrocellulose filter can be incubated 
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overnight at 65''C with a iabieled probe in a solution containing 50%formamide, 
high salt (5 x SSC or 5 x SSPE), 5 xDenhardt's solution, 1% SDS. and 100 pg/ml 
denatured canier DNA (i e. salmon sperm DNA). The non-specifically binding 
probe can then be washed off the filter by several washes In 0.2 x SSC/0.1% SDS 
5 at a temperature which is selected in view of the desired strinigency: room 
temperature (low stringency), 42°C (moderate stringency) or 65°C (high 
stringency). The selected temperature is based on the melting temperature (Tm) 
of the DMA hybrid (Sambrook et al. 1 989, supra). Of course. RNA-DNA hybrids 
can also be formed and detected. In such cases, the conditions of hybridization 
1 0 and washing can be adapted according to well known methods by the person of 
ordinary skill. Stringent conditions will be preferably used (Sambrook et al..1989, 
supra). 

Probes or primers of the invention can be utilized with 
naturally occuning sugar-phosphate backbones as well as modified backbones 

15 including phosphorothioates, dithionates, alkyi phosphonates and a-nucleotides 
and the like. Modified sugar-phosphate backt)ones are generally taught by Miller. 
1988. Ann. Reports Med. Chem. 23:295 and Moran et al.. 1987, Nucleic acid 
molecule. Acids Res., 14:5019. Probes or primers of the invention can be 
constructed of either ribonucleic acid (RNA) or deoxyribonucleic add (DNA), and 

20 preferably of DNA. 

The types of detection methods in which probes can be used 
include Southern blots (DNA detection), dot or slot blots (DNA. RNA), and 
Northern blots (RNA detection). Although less preferred, labeled proteins could 
also be used to detect a particular nucleic acid sequence to which it binds. Other 
25 detection methods include kits containing probes on a dipstick setup and the like. 

Altiiough the present invention is not specifically dependent 
on the use of a label for the detection of a particular nucleic acid sequence, such 
a label might be beneficial, by increasing the sensitivity of the detection. 
Furthermore, it enables automation (the same can also be said of detection of 
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proteins using ligands such as antibodies). Probes can be labeled according to 
numerous well known methods (Sambrook et al., 1989, supra). Non-limiting 
examples of detectable markers Include ligands. fluorophores, chemiluminescent 
agents, enzymes, and antibodies. Other detectable markers for use with probes, 
5 which can enable an increase in sensitivity of the method of the invention, include 
biotin and radionucleotides. It will become evident to the person of ordinary skill 
that the choice of a particular label dictates the manner in which it is bound to the 
probe. 

As commonly known, radioactive nucleotides can be 

10 incorporated into probes of the invention by several methods. Non-limiting 
examples thereof include kinasing the 5' ends of the probes using gamma '^P 
ATP and polynucleotide kinase, using the Klenow fragment of Pol I of E. co// in 
the presence of radioactive dNTP (i.e. uniformly lal)eled DNA prot>e using random 
oligonucleotide primers in low-melt gels), using the SP6/T7 system to transcribe 

15 a DNA segment in the presence of one or more radioactive NTP, and the like. 

As used herein, "oligonucleotides" or "oligos" define a 
molecule having two or more nucleotides (ribo or deoxyribonucleotides). The size 
of the oligo will be dictated by the particular situation and ultimately on the 
particular use thereof and adapted accordingly by the person of ordinary skill. An 

20 oligonucleotide can be synthetised chemically or derived by cloning according to 
well known methods. 

As used herein, a "primer' defines an oligonucleotide which is 
capable of annealing to a target sequence, thereby creating a double stranded 
region which can serve as an initiation point for DNA synthesis under suitable 

25 conditions. 

Amplification of a selected, or target, nucleic acid sequence 
may be carried out by a number of suitable methods. See generally Kwoh et al., 
1990, Am. Biotechnol. Lab. 8:14-25. Numerous amplification techniques have 
been described and can be readily adapted to suit particular needs of a person 
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of ordinary skill. Non-limiting examples of amplification techniques include 
polymerase chain reaction (PCR). ligase chain reaction (LCR), strand 
displacement amplification (SDA), transcription-based amplification, the Qp 
replicase system and NASBA (Kwoh et al.. 1989. Proc. Natl. Acad.Sci. USA 86. 
5 11 73-1 1 77; Uzardi et al. , 1 988, BioTechnology 6: 1 1 97-1 202; Maiek et al., 1 994, 
Methods Mol. Biol., 28:253-260; and Sambrook et al, 1989. supra). Preferably, 
amplification will be carried out using PCR. 

Polymerase chain reaction (PCR) is earned out in accordance 
with known techniques. See. e.g., U.S. Pat. Nos. 4.683.195; 4,683,202; 

10 4.800,159; and 4,965.188 (the disclosures of all three U.S. Patent are 
incorporated herein by reference). In general. PCR involves, a treatment of a 
nucleic acid sample (e.g.. in the presence of a heat stable DNA polymerase) 
under hybridizing conditions, with one oligonucleotide primer for each strand of 
the specific sequence to h>e detected. An extension product of each primer which 

15 is synthesized is complementary to each of the two nucleic acid strands, with the 
primers sufficiently complementary to each strand of the specific sequence to 
hybnidize therewith. The extension product synthesized from each primer can also 
serve as a template for further synthesis of extension products using the same 
primers. Following a sufficient number of rounds of synthesis of extension 

20 products, the sample is analysed to assess whether the sequence or sequences 
to be detected are present. Detection of the amplified sequence may be carried 
out by visualization following EtBr staining of the DNA following gelelectrophbres, 
or using a detectable label in accordance with known techniques, and the like. For 
a review on PCR techniques (see PCR Protocols, A Guide to Methods and 

25 Amplifications, Michael et al. Eds. Acad. Press, 1990). 

Ligase chain reaction (LCR) is carried out In accordance with 
known techniques (Weiss. 1991, Science 254 :1292). Adaptation of the protocol 
to meet the desired needs can be carried out by a person of ordinary skill. Strand 
displacement amplification (SDA) is also carried out in accordance with known 



wo 01/18544 



PCT/CA0Oy01052 



15 



techniques or adaptations thereof to meet the particular needs (Walker et ai., 
1992, Proc. Natl. Acad. Sci. USA 89:392-396; and ibid.. 1992. Nucleic Acids Res. 
20:1691-1696). 

As used herein, the tenn "gene" is well known in the art and 
5 relates to a nucleic acid sequence defining a single protein or polypeptide. A 
"structural gene" defines a DNA sequence which is transcribed into RNA and 
translated into a protein having a specific amino acid sequence thereby giving rise 
the a specific polypeptide or protein. It will be readily recognized by the person of 
ordinary skill, that the nucleic acid sequence of the present invention can be 
10 incorporated into anyone of numerous established kit formats which are well 
known in the art. 

A "heterologous" (i.e. a heterologous gene) region of a DNA 
molecule is a subsegment segment of DNA within a larger segment that is not 
found in association therewith in nature. The term lieterologous" can be similarly 

15 used to define two polypeptidic segments not joined together in nature. Non- 
limiting examples of heterologous genes include reporter genes such as green 
fluorescence protein, luciferase. chloramphenicol acetyl transferase, p- 
galactosidase, and the like which can be juxtaposed or Joined to heterologous 
control regions or to heterologous polypeptides. 

20 The term "vector" is commonly known in the art and defines a 

plasmid DNA, phage DNA, viral DNA and the like, which can serve as a DNA 
vehicle into which DNA of the present invention can be cloned. Numerous types 
of vectors exist and are well known in the art. 

The term "expression" defines the process by which a gene is 

25 transcribed into mRNA (transcription), the mRNA is then being translated 
(translation) into one polypeptide (or protein) or more. 

The terminology "expression vector" defines a vector or 
vehicle as descriljed above but designed to eniable the expression of an inserted 
sequence following transformation into a host. The cloned gene (inserted 
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sequence) is usually placed under the control of control element sequences such 
as promoter sequences. The placing of a cloned gene under such control 
sequences is often refered to as being operably linked to control elements or 
sequences. 

5 Operably linked sequences may also include two segments 

that are transcribed onto the same RNA transcript. Thus, two sequences, such 
as a promofer and a "reporter sequence" are operably linked if transcription 
commendng in the promoter will produce an RNA transcript of the reporter 
sequence, in order to t>e "operaNy linked" it is not necessary that two sequences 

10 t)e immediately adjacent to one another. 

Expression control sequences will vary depending on whether 
the vector is designed to express the operably linked gene in a prokaryotic or 
eukaryotic host or both (shuttle vectors) and can additionally contain 
transcriptional elements such as enhancer elements, termination sequences. 

15 tissue-specificity elements, and/or translationai initiation and termination sites. 

Prokaryotic expressions are useful for the preparation of large 
quantities of the protein encoded by the DNA sequence of interest. This protein 
can be purified according to standard protocols that take advantage of the 
intrinsic properties thereof, such as size and charge (i.e. SDS gel electrophoresis, 

20 gel filtration, centrifugation. ion exchange chromatography...). In addition, the 
protein of interest can be purified via affinity chromatography using polyclonal or 
monoclonal antibodies. The purified protein can be used for diagnostic or 
therapeutic applications. 

The DNA construct can be a vector comprising a promoter 

25 that is operably linked to an oligonucleotide sequence of the present invention, 
which is in turn, operably linked to a heterologous gene, such as the gene for the 
luciferase reporter molecule. "Promoter" refers to a DNA regulatory region 
capable of binding directly or indirectly to RNA polymerase in a cell and initiating 
transcription of a downstream (3' direction) coding sequence. For purposes of 
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the present invention, the promoter is bound at its 3* terminus by the transcription 
initiation site and extends upstream (5' direction) to include the minimum number 
of bases or elements necessary to initiate transcription at levels detectable above 
background. Within the promoter will be found a transcription initiation site 
5 (conveniently defined by mapping with SI nuclease), as well as protein binding 
domains (consensus sequences) responsible for the binding of RNA polymerase. 
Eukaryotic promoters will often, but not always, contain "TATA" boses and 
"CCAT" boxes. Prokaryotic promoters contain Shine-Dalgamo sequences in 
addition to the -10 and -35 consensus sequences. 

10 As used herein, the designation "functional derivative' 

denotes, in the context of a functional derivative of a sequence whether an 
nucleic acid or amino acid sequence, a molecule that retains a biological activity 
(either function or structural) that is substantially similar to that of the original 
sequence. This functional derivative or equivalent may be a natural derivative or 

15 may be prepared synthetically. Such derivatives include amino acid sequences 
having substitutions, deletions, or additions of one or more amino acids, provided 
that the biological activity of the protein is conserved. The same applies to 
derivatives of nucleic acid sequences which can have substitutions, deletions, or 
additions of one or more nucleotides, provided that the biological activity of the 

20 sequence is generally maintained. When relating to a protein sequence, the 
substituting amino acid as chemico-physical properties which are similar to that 
of the substituted amino acid. The similar chemico-physical properties include, 
similarities in charge, bulkiness, hydrophobicity. hydrophylicity and the like. The 
term "functional derivatives" is intended to include "fragments", "segments", 

25 "variants", "analogs ' or "chemical derivatives ' of the subject matter of the present 
invention, 

Thus, the term "variant" refers herein to a protein or nucleic 
acid molecule which is substantially similar in structure and biological activity to 
the protein or nucleic acid of the present invention. 
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The functional derivatives of the present invention can be 
synthesized chemically or produced through recombinant DNA technology. All 
these methods are well known in the art. 

As used herein, "chemical derivatives" is meant to cover 

5 additional chemical moieties not normally part of the subject matter of the 
invention. Such moieties could affect the physico-chemical characteristic of the 
derivative (i.e. solubility, absorption, half life and the like, decrease of toxicity). 
Such moieties are examplified in Remington's Pharmaceutical Sdences (1980). 
Methods of coupling these chemical-physical moieties to a polypeptide are well 

10 known in the art. 

The term "allele" defines an alternative form of a gene which 
occupies a given locus on a chromosome. 

As commonly known, a "mutation' is a detectable change in 
the genetic material which can be transmitted to a daughter cell. As well known, 

15 a mutation can be, for example, a detectable change in one or more 
deoxyribonucleotide. For example, nucleotides can be added, deleted, substituted 
for, inverted, or transposed to a new position. Spontaneous mutations and 
experimentally induced mutations exist. A mutant polypeptide can be encoded 
from a mutant nucleic acid molecule. In addition, mutant proteins can be 

20 produced through aberrant events during replication, transcription and/or 
translation. Frameshifting (the switching from a particular reading frame to 
another) is such a mechanism that can modify the sequence of the translated 
protein. 

As used herein, the term "purified" refers to a molecule having 
25 been separated from a cellular component. Thus, for example, a "purified protein' 
has been purified to a level not found in nature. A "substantially pure" molecule 
is a molecule that is lacking in all other cellular components. 

As used herein, the terms "molecule", "compound", "agent", or 
"ligand" are used interchangeaibly and broadly to refer to natural, synthetic or 
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semi-synthetic molecules or compounds. The tenn "molecule" therefore denotes 
for example chemicals, macromolecules. cell or tissue extracts (from plants or 
animals) and the like. Non limiting examples of molecules include nucleic acid 
molecules, peptides, antibodies, cartxjhydrates and pharmaceuticaUherapeutical 
agents. The agents can be selected and screened by a variety of means 
including random screening, rational selection and by rational design using for 
example protein orligand modelling methods such as computermodelling. The 
terms "rationally selected" or "rationally designed" ^re meant to define, for 
example, compounds which have t>een chosen based on the configuration of the 
polyalanine domains of the present invention. As will be understood by the 
person of ordinary skill, macromolecules having non-naturally occurring 
modifications are also within the scope of the term "molecule". For example, 
peptidomimetics, well known in the pharmaceutical industry and generally 
referred to as peptide analogs can be generated by modelling as mentioned 
above. Similarly, in one embodiment, the polypeptides of the present invention 
can be modified to enhance or decrease their stability. It should be understood 
that in most cases this modification should not alter the biological activity of the 
polyalanine domain (its toxic effect or INI localization property). The molecules 
identified in accordance with the teachings of the present invention have a 
therapeutic value in diseases or conditions in which the physiology or 
homeastasis of the cell and/or tissue is compromised by a production of 
polyalanine-containing proteins or polypeptides. Alternatively, the molecules 
identified in accordance with the teachings of the present invention find utility in 
the development of more efficient molecule to lower and/or abrogate the toxicity 
of such proteins and/or to reduce or eliminate the production of such mutant 
proteins. It will be understood that agents can be screened, in accordance with 
the present invention, with libraries of compounds, using for example automated 
screening methods (e.g. array technologies). 
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The level of gene expression of the reporter gene (e.g. the 
level of luciferase, or p-gal, produced) within the treated cells can be compared 
to that of the reporter gene in the absence of the molecules(s). The difference 
between the levels of gene expression indicates whether the molecule(s) of 
5 interest agonizes the aforementioned interaction. The magnitude of the level of 
reporter gene product expressed (treated vs. untreated cells) provides a relative 
indication of the strength of that molecule(s) as an agonist. The same type of 
approach can also be used in the presence of an antagonist(s). Thus, 
modulators of the production of such proteins can he identified and selected. 
10 Non-limiting examples of modulators in accordance with the present invention 
include frameshift mutants or suppressors, relievers of codon rareness (i.e. 
relieving the limitation of rare codons which favor frameshifting events), agents 
which degrade polyalanine stretches, tRNAs. tRNA suppressors and the like. 
One skilled in the art will realize that the assays to identify compounds that 
1 5 modulate frameshifting of CAG repeats and the like, could l>e carried out using 
other repeats as well as other genes known to promote frameshifting. Such 
genes are known in the art. 

The present invention also provides antisense nucleic acid 
molecules which can l>e used for example to modulate the expression of the 
20 mutant proteins of the present invention. An antisense nucleic acid molecule 
according to the present invention refers to a molecule capable of forming a 
stable duplex or triplex with a portion of its targeted nucleic add sequence (DMA 
or RNA). The use of antisense nucleic acid molecules and the design and 
modification of such molecules is well known in the art as descritied for example 
25 in WO 96/32966, WO 96/11266. WO 94/15646. WO 93/08845 and 
USP 5,593,974. Antisense nucleic acid molecules according to the present 
invention can be derived from the nucleic acid sequences and modified in 
accordance to well known methods. For example, some antisense molecules can 
be designed to be more resistant to degradation to increase their affinity to their 
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targeted sequence, to affect their transport to chosen cell types or cell 
compartments, and/or to enhance their lipid solubility bu using nucleotide analogs 
and/or substituting chosen chemical fragments thereof, as commonly known In 
the art. 

5 Alternatively, an indicator cell in accordance with the present 

invention can be used to identify antagonists. For example, the test molecule or 
molecules are incubated with the host cell in conjunction with one or more 
agonists held at a fixed concentration. An indication and relative strength of the 
antagonistic properties of the molecule(s) can be provided by comparing the level 

10 of gene exprisssion in the indicator cell iri the presence of the agonist, in the 
absence of test molecules vs in ttie presence thereof. Of course, the antagonistic 
effect of a molecule can also be determined in the absence of agonist, simply by 
comparing the level of expression of the reporter gene product in the presence 
and absence of the test molecule(s). 

15 It shall be understood that the "in vivo" experimental model 

can also be used to carry out an "in vitro" assay. For example, cellular extracts 
from the indicator cells can be prepared and used in one of the aforementioned 
"/nwfro" tests. 

As used herein the recitation "indicator cells" refers to, fpr 
20 example, cells that express a fusion protein comprising apolyalanine segment 
(e.g. a "CAG" repeat) and an identifiable or selectable phenotype or characteristic 
which enables an assessment of the level of fusion protein expression (e.g. a 
reporter protein). Such indicator cells can be used in the screening assays of the 
present invention. In certain embodiments, the indicator cells have been 
25 engineered so as to express a chosen derivative, fragment, homotog, or mutant 
of a repeat. It should be understood that the repeats should not be limited to CAG 
repeats. Indeed. GCG repeats can also be used. In addition, the invention should 
not be limited to polyalanine repeats, since the present invention provides for the 
testing of polyserine fragments and other polyaminb acids which could be 
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expressed from a frameshifting event. The cells can be yeast cells or higher 
eukaryotic cells such as mammalian ceils (WO 96/41169). Preferably, the 
indicator cells are higher eukaryotic cells. Non-limiting examples of-such cells 
and vectors are exemplified herein below (i.e. Examples 2-4). In one particular 
5 embodiment, the indicator cell could be used to test a compound or a library 
thereof. 

As exemplified herein below in one embodiment, a 
polyalanine polypeptide segment of the present invention is provkled as a fusion 
protein. The design of constructs therefor and the expression and production of 

1 0 fusion proteins are exemplified herein and are well known in the art (Sambrook. 
et al., 1989, supra; and Ausubel et al., 1994, supra). 

Non limiting examples of such fusion proteins include a 
hemaglutinin fusions (HA) and Gluthione-S-transferase (GST) fusions. In certain 
embodiments. It might be beneficial to introduce a protease cleavage site 

1 5 between the two polypeptide sequences which have been fused. Such protease 
cleavage sites between two heterologously fused polypeptides are well known In 
the art. 

In certain emtx)diments, it might also t>e beneficial to 
introduce a linker (cornmonly known) between the repeat segment of the protein 

20 and the heterologous polypeptide portion (e.g. reporter protein portion). Such 
fusion protein find utility in the assays of the present invention as well as for 
purification purposes, detection purposes and the like. 

For certainty, the sequences and polypeptides useful to 
practice the invention include without being limited thereto mutants, homologs, 

25 subtypes, alleles and the like. It shall be understood that generally, the 
sequences of the present invention should encode a functional (albeit defective) 
repeat domain. U will be clear to the person of ordinary skin that whether a repeat 
domain of the present invention, variant, derivative, or fragment thereof retains 
its function in enabling a concentration of the protein containing same in INI or 
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triggering toxicity in cells or animals, can be readily determined by using the 
teachings and assays of the present invention and the general teachings of the 
art. 

As exemplified herein below, the repeat domains of the 
5 present invention can be modified, for example by in vitro mutagenesis, to dissect 
the structure-function relationship thereof and permit a better design and 
identification of modulating compounds. However, some derivative or analogs 
having lost their biological function may still find utility, for example for raising 
antilxxlies. These antil)0dies could be used for detection or purification purposes. 
10 In addition, these antibodies could also act as competitive or non-competitive 
inhibitor and be found to be modulators of the biological activity of the repeat 
domain. 

A host cell or indicator ceil has been Iransfected" by 
exogenous or heterologous DNA (e.g. a DNA construct) when such DNA has 
15 been introduced inside the cell. The transfecting DNA may or may not be 
integrated (covalently linked) into chromosomal DNA making up the genome of 
the cell. In prokaryotes. yeast, and mammalian cells for example, thetransfecting 
DNA may be maintained on a episomal element such as a plasmid. With respect 
to eukaryotic cells, a stably transfected cell is one in which the transfecting DNA 
20 has become integrated into a chromosome so ttiat it is inherited by daughter cells 
through chromosome replication. This stability is demonstrated by the ability of 
the eukaryotic cell to establish cell lines or clones comprised of a population of 
daughter cells containing the transfecting DNA. Transfection methods are well 
known in the art (Sambrook et al., 1989. supra; Ausubel et al.. 1994 supra). 
25 In general, techniques for preparing antibodies (including 

monoclonal antibodies and hybridomas) and for detecting antigens using 
antitx)dies are well known in the art (Campbell. 1984. In "Monoclonal Antibody 
Technology: Laboratory Techniques in Biochemistry and Molecular Biology", 
Elsevier Science Publisher. Amsterdam. The Netheriands) and in Hariow et al., 
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1988 (in: Antibody- A Laboratory Manual, CSH Laboratories). Tlie present 
invention also provides polyclonal, monoclonal antibodies, or humanized versions 
thereof, chimeric antibodies and the like which are specific to the repeat domains 
of the present invention. 
5 From the specification and appended claims, the term 

therapeutic agent should be taken in a broad sense so as to also include a 
combination of at least two such therapeutic agents. Further, the therapeutic 
agent according to the present invention can be introduced into individuals in a 
number of ways. The therapeutic agent can also be delivered through a vehicle 
1 0 such as a liposome, which can be designed to be targeted to a specific cell type, 
and engineered to be administered through different routes. Having shown that 
a polyalanine segment is toxic to cells, the present invention provides the means 
to trigger toxicity in cells by expressing thereinto or delivering thereto a 
pblydlanine-containing protein (or polyalanine-encoding nucleic acid), in 
15 accordance with known methods, a chosen cell population could lie targeted. 

For administration to humans, the prescribing medical 
professional will ultimately determine the appropriate form and dosage for a given 
patient, and this can be expected to vary according to the chosen therapeutic 
regimen (i.e. pNA construct, protein, cells), the response and condition of the 
20 patient as well as the severity of the disease. 

Composition within the scope of the present invention should 
contain the active agent (i.e. fusion protein, nucleic acid, and molecule) in an 
amount effective to achieve the desired therapeutic effect while avoiding adverse 
side effects. Typically, the nucleic acids, fusion proteins and molecules in 
25 accordance with the present invention can be administered to mammals (i.e. 
humans) in doses ranging from 0.005 to 1 mg per kg of body weight per day of 
the mammal which is treated. Pharmaceutically acceptable preparations and 
salts of the active agent are within the scope of the present inviention and are well 
known in the art (Remington's Pharmaceutical Science. 16th Ed , Mack Ed.). For 
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the administration of polypeptides, antagonists, agonists and the like, the amount 
administered should be chosen so as to avoid adverse side effects. The dosage 
will be adapted by the clinician in accordance with conventional factors such as 
the extent of the disease and different parameters from the patient. Typically, 
5 0.001 to 50 mg/kg/day will be administered to the mammal. 

The present invention also relates to a kit for diagnosing a 
disease or condition associated with the expression of a repeat domain, encoding 
for example a polyalanine stretch, or a predisposition to contracting same 
comprising a nucleic acid, a protein or a iigand in accordance with the present 

10 invention. For example, a compartmentalized kit in accordance with the present 
invention includes any kit in which reagents are contained in separate containers. 
Such containers include small glass containers, plastic containers or strips of 
plastic or paper. Such containers allow the efficient transfer of reagents from one 
compartment to another compartment such that the samples and reagents are not 

1 5 cross-contaminated and the agents or solutions of each container can b>e added 
in a quantitative fashion from one compartment to another. Such containers will 
include a container which will accept the test sample (DNA protein or cells), a 
container which contains the primers used in the assay, containers which contain 
enzymes, containers which contain wash reagents, and containers which contain 

20 the reagents used to detect the extension products. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Having thus generally described the invention, reference will 
now be made to the accompanying drawings, showing by way of illustration a 
preferred embodiment thereof, and in which: 
25 Figure 1 shows the Western blot analysis of lymphoblastoid 

cells from controls and MJD patients, a. Schematic representation of the MJD- 
Ala protein that results from a frameshift in the CAG tract showing the new C- 
terminus (italicized; used to raise the FS1 and FS2 antibodies). Western blots of 
two control lymphoblastoid cell lines (cLCL) and four MJD lymphoblastoid cell 
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lines (MJDLCL) immunoprobed with FS1 (b), anti-ataxin-3 (c), 1C2 (d) and FS1 
pre-immune semm (e). Arrow indicates the threshold between stacking and 
resolving portions of the gel. Panels b-d represent serial probing of a single 
membrane. 

5 Figure 2 shows the immunocytochemical detection of 

intranuclear deposits in lymphoblastoid cells. Immunocytochemistry of control 
LCL versus MJD LCL: absence of INI in control LCL probed with FS1 (a), and 
anti-ubiquitin (c). and detection of INI in MJD LCL with FS1 (b) and antiiibiquitin 
(d). e, Immuno detection of MJD LCL with FS1 pre-immune serum. For all 
10 panels, the magnification before publication is 400x (left) and lOOOx (right). 
These results have been replicated in three separate experiments. 

Figure 3 shows the immunohistochemical detection of INI in 
MJD pontine neurons. Immunoprobing with FS1 antiserum in MJD pons (a) and 
control pons (b); immunoprobing with anti-ubiquitin in MJD pons (c) and control 
15 pons (d). INI in pontine neurons are indicated by arrowheads. Double labeling 
immunofluorescence analysis of MJD pons showing ubiquitin-labeled INI (e, h) 
and FS1 -labeled INI (f, i). and the composite image of bothlabelings (g, j). For all 
panels, the magnification before publication is 1000x. before reproduction. 

Figure 4 shows the constructs used in the transfeclion 
20 experiments. All constructs, with the exception of pMJD1 1 , represent fulHength 
MJD-1. Solid black tioxes indicate the repeat portion of the constructs. 
Staggered ends indicate that EGFP will only be expressed if aframesh'rfl occurs. 
Encircled blown up detail of pMJD1 is also present in pMJD2 and pMJD3; details 
of pMJD5 are present in pMJD6; details of pMJD7 are present in pMJD8 and 
25 detailsof pMJD9arethesameaspMJD10. 

Figure 5 shows the transfection experiments with different 
MJD/EGFP constructs, a, DNA sequence of the clones with EGFP out of frame 
(pMJDI. pMJD2 and pMJD3) and b, with EGFP inglutamine frame (pMJD4), both 
(a, b) showing the predicted amino acid sequence, c-e. Fluorescence at 72 hours 
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of COS-7 cells transfected with pMJD1 (c). pMJD2 (d) and pMJD3 (e) where the 
MJD(CAG)n/EGFP fusion protein is translated only when frameshifts to GCA- 
polyAla occur, f. COS-7 cells transfected with pMJD4 where EGFP is in frame 
with CAG-Gln. g. COS-7 cells transfected with the vector pEGFP-NI. h* 
5 perinuclear fluorescent aggregates observed in cells transfected with pMJD3. 
Pictures of sections c, d, e, f and g were taken at 25 x magnification for a fixed 
exposure time of 60 seconds before reproduction; picture shown in h, is at 400 
X before reproduction, i-k. Western blots of protein isolated from cells shown in 
c. d. e, f. g and from mock-transfected cells immunoprobed with 1C2 (i), anti- 

10 ataxin-3 (j) and anti-HA (k). Arrows on the. right of panel j indicate the proteins 
detected. Arrowhead on panel k indicates threshold between stacking and 
resolving portions of the gel. Rpts stands for repeats. 

Figure 6 shows the time-course immunocytochemical analysis 
of COS-7 cells transfected with constructs encoding ataxin-3 with either apolyAia 

15 or a polyGIn tract. Immunoprobing with anti-HA antibody at time-points 8 hours: 
(a) pMJD7. (b) pMJD9. (c) pMJD8, (d) pMJDIO and (e) vector pEGFP-N1 alone; 
20 hours: (f) pMJD7. (g) pMJD9, (h) pMJD8. « pMJDIO and (j) vector pEGFP-N1 
alone; 24 hours: (k) pMJD7. (I) pMJD9. (m) pMJD8. (n) pMJDIO and (o) vector 
pEGFP-NI alone; 48 hours: (p) pMJD7. (q) pMJDQ. (r) pMJDB. (s) pMJDIO and 

20 (t) vector pEGFP-N1 alone. For all panels pictures were taken at 1000x 
magnification, before reproduction. 

Figure 7 shows the western analysis of transfected COS-7 
cells (in figure 5). Blots were probed with (a) anti-HA, (b) 1C2. Arrow indicates 
threshold between stacking and resolving portions of the gel. Cells were 

25 harvested at 72 hours after transfection. 

Figure 8 shows the immunocytochemical analysis of COS-7 
cells transfected with a truncated polyAla-encoding construct. Immunoprobing 
with anti-HA antibody in: (a) cells transfected with 42A and (b) mockfransfected 
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cells. For ail panels pictures were taken at 1000x magnification, t>efore 
reproduction. 

Other objects, advantages and features of the present 
invention will tiecome more apparent upon reading of the foHowing non-restrictive 
5 description of preferred emtjodiments with reference to the accompanying 
drawing which is exemplary and should not be interpreted as limiting the scope 
of the present invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENT 

The data herein presented strongly suggest the fact that (1) 
10 frameshifts occur within CAG repeats and are responsible for the production of 
alanine-containing proteins, and (2) these proteins accumulate as toxic 
aggregates. Detection of the hypothetical peptide inlymphoblastoid and neuronal 
cells of MJD patients and not in controls, and the production of green 
fluorescence preferentially In cells transfected with MJD-1 bearing large CAG 
15 tracts with an out of frame EGFP. supports the occurrence of rare frameshifts 
during transcription and/or translation of long CAG tracts resulting from the use 
of the alternative GCA/Ala reading frame. The relatively small proportion of 
frameshifted product may explain the absence of loss-of-function of the protein 
and the relatively late-onset of these diseases. This model of slow accumulation, 
20 due to relatively rare frameshift events, better explains the late-onset nature of 
these diseases given that, in light of the high expression of these proteins in 
affected brain regions, a glutamine toxicity model would be expected to result in 
much eariier cell death and disease onset^''^'. 

Slippage into the third possible frame, AGC/Ser. may also be 
25 occurring. However, the absence of diseases associated with tracts of polyserine 
and the physical nature of alanine polymers, resulted in an exclusive focus on the 
GCA/Ala frame. While the relative frequencies of different frameshifts remain 
unknown, the fact that most translational frameshift errors cause a +2 shift in the 
frame^ suggests that GCA/Ala may be the more frequent of the mutant species. 
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The present invention also assessed whether polyAla tracts 
are toxic and may be the initiating event in the formation of the INI seen in 
expanded CAG tract diseases. The evidence presented herein shows that in 
cells transfected with the CAG/SIn constructs, frameshifling into the alanine frame 
5 is progressive and frameshifled products are slowly accumulating in the nucleus 
as INI (see Rg. 5). Given the expected low frequency of frameshifls, the finding 
of frameshifted protein in INI as early as 12 hours aftertransfection argues that 
potyAla accumulation is a very eariy event in the formation of these structures. In 
fact, in cells transfected with as few as 14CAGs. frameshifts are occurring and 

10 polyalanine-containing protein is accumulating in the nucleus as INI. This finding 
is consistent with a recent report by Perez et al. showing that transfection of 
normal sized CAG repeats leads to INI^. In contrast with the slow, progressive 
accumulation detected for the CAGA3ln constructs, transfection with the GCA/Ala 
constructs results in early and rapid accumulation of alanine-containing product. 

15 In this case the cells seem unable to cope with the presence of the toxic product 
resulting in an earlier and much more severe phenotype. The pattern of 
expression of polyAia products, consisting of one major juxtanudear aggregate 
per cell and several smaller inclusions, is similar to that found inaggresomes, 
structures that form when the capacity of theproteasome degradation pathway 

20 is exceeded^'. This seems to indicate that the direct expression ofpolyalanlne 
in cells is likely to be extremely toxic. The presence of more "classical" INI in cells 
expressing CAG/GIn constructs is thought to result from rareframeshift events 
that produce alanine-containing protein, which slowly accumulates in the nucleus 
as aggregates. This model therefore better depicts what may be happening in 

25 diseased tissue cells. 

The proposed model of toxic aggregates resulting from 
frameshifts into the alanine frame is consistent with previous experiment 
performed with MJD, as well as other exp-CAG diseases. For example, in 
Huntington disease (HD) and MJD. INI and protein accumulation are most clearly 



wo 01/18544 



PCT/CA0OA)1052 



30 



detected using antibodies against epitopes to the N-tenninus side of the CAG 
repeat*-*, iiltely because frameshifting will truncate the protein shortly after the 
repeat. This model could also explain the correlation observed in all GAG triplet 
diseases between the size of the repeat and the severity of the symptoms^. It is 
5 suggested that if the frameshtft errors occur randomly, the longer the repeat the 
more frequently such errors would arise. This is supported by the observations 
that (1) no accumulation was yet detected in the lymphoblasts of a MJD patient 
with the shortest {CAGki mutation; (2) fluorescence increased with the size of the 
CAG repeat in the transfection experiments; and (3) at any time-point, more than 
10 twice as many cells have inclusions containing frameshifted species when 
transfected with a (CAG)82 construct than with a (CAG),5 construct. 

Another puzzling finding previously reported by other groups, 
and which can be explained by the model of frameshifts into the alternate alanine 
frame and the production of prematurely truncated proteins herein proposed, is 
1 5 the presence of short cleavage products in inclusions in HD and spinal and bulbar 
muscular atrophy (SBMA)^^. These short products seem to be too small to be 
the result of caspase-3 cleavage, but are of a size consistent with premature 
termination of transcription or translation due to the generation of an early STOP 
codon in the alanine shifted frame. 
20 The model presented herein might also explain the 

observation that while both CAA and CAG codons encode glutamine, only 
uninterrupted CAG tracts cause disease. Inspinocerebellar ataxia type 1 (SCA1), 
for polyglutamine tracts of similar length the presence of a CAAcodon intenupting 
a CAG tract is used to differentiate a normal from a disease causing allele^*-". 
25 These observations, while diffiojit to explain if polyglutamine is toxic, are 
predicted by the proposed poiyAla nuclear toxicity model of the present invention, 
( where interruption of a repeat sequence may lead to more stable transcription or 
translation. 
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Finally, the present invention may predict a similar pathogenic 
mechanism in the other diseases caused by expanded CAG tracts. It has been 
shown here that constructs containing almost exclusively a GCA tract are toxic 
to cells and lead to the fomiation of aggregates. This seems to indicate that the 

5 full MJD protein is not necessary to obtain a disease phenotype and that the 
presence of an expanded CAG repeat within any protein may be sufficient to 
cause disease. This would support the contention that the protein or gene 
causing the disease is indeed irrelevant to the disease process (Hardy et al. 
1998). in addition, it is possible thatpolyAla accumulation may play a role in 

10 aging of certain cell types. For example, ifframeshifts causing polyAla tracts 
occur with all CAG repeat-containing genes even in the "nomial range" as is seen 
in the 14 CAG- and 37 CAG-containing constructs of the present invention, very 
slow accumulation of polyAla polymers may occur, leading to cell death. This 
could explain the observation that fewer large repeats are found in healthy elders 

15 compared to younger controls ^. The discovery of expanded polyalanine 
domains in three diseases"^^' and the fact that CAG repeat frameshift mutant 
proteins can cause disease if they code for polyAla, pinpoints this homopdlymer 
as a potential target for drug design. Polyalanine nuclear toxicity may well be a 
frequent cause of premature cell death in different tissues. 

20 The present invention is illustrated in further detail by the 

following non-limiting examples. 

EXAMPLE 1 

Frameshifts occur in CAG tracts resulting in polyaianine- 
containing proteins 

25 Two immunopurified polyclonal antisera were raised against 

a synthetic peptide corresponding to the 12 last amino acids predicted to result 
from polyAla tract producing frameshifts within the CAG repeat of the MJD- 7 gene 
(FS1 and FS2)(Fig. la). This new amino acid sequence has no homology to any 
known protein. Both antisera detected high molecular weight aggregates in the 
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Stacking gels in Western blots of total lymphoblast protein from three MJD cases 
(Fig. lb). In the two controls and in the MJD patient with the shortest (CAG)67 
repeat no aggregates were observed (Fig. lb). Both FS antibodies also 
recognize, in all samples, an 83kD protein of unknown origin. In order to test if the 
5 signal detected in the stacking gels was the putative MJD-Ala protein, the same 
blots were probed with antisera raised against MJD protein epitopes (anti-ataxin- 
3; Fig. 1c)" and against polyGIn domains (1C2; Fig. Id)" These results are 
compatible with the presence of the predicted hybrid protein In the stacking gel. 
An anti-ubiquitin polyclonal antibody (data not shown) also detected the 
10 accumulations, which is consistent with previous reports' *• " ^. These 
accumulations are compatible with those reported by Paulson ef a/, who studied 
cells expressing mutant CAG tract expanded MJD-I using an antibody raised 
against the expressed ataxin-3 fusion proteinsV 

To test if the MJD-Ala protein accumulates in nuclei, 
1 5 immunocytochemistry on lymphoblasts from four controls and three MJD patients 
was performed. As Fig. 2 depicts, FS1 positive INI are observed in MJD cell lines 
and not in controls (Fig. 2a. b). Similar to the Western blot results, the anti- 
ubiquitin antibody (Fig. 2c. d) and the anti-ataxin-3 antibody (data not shown) also 
detect INI in MJD cell lines. 
20 In an attempt to show the presence of the frameshifted 

species in affected MJD brain regions, immunohistochemical FS1 and anti- 
ubiquitin staining in diseased and control pons, a region known to be affected in 
MJD was performed (Fig. 3). Both antibodies stain intranuclear structures in 
neurons of this MJD brain area (Fig. 3a, c), whereas pontine neurons in control 
25 brain have no INI (Fig. 3b, d). In immunofluorescence studies, it was shown that 
the frameshifted product colocalizes with ubiquitin in INI of pontine neurons (Fig. 
3e, f. g, h, i and j). suggesting that the intranuclear structures detected by both 
antibodies are the same. 
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EXAMPLE 2 

Frameshifts into alanine frame occur preferentially 
with expanded CAG tracts 

To test if frameshifts producing GCA/poIyAia occur 
5 preferentially within expanded CAG tracts, an in vitro system to examine the effect 
of CAG repeat length on the frequency of frameshifts was designed. COS-7 cells 
were transfected with constructs bearing the full length MJD-1 sequence with 
either a (CAG),4. (CAGy^ or (CAG)b2 repeat fused out of frame to VheEGFP gene 
and driven by the CMV promoter (Fig. 4 and Fig. 5a). Whileframeshtfts would 

10 occur with all CAG tracts, it was hypothesized that they would occur more 
frequently within larger repeats. The pMJDI. pMJD2 and pMJDS constructs 
would yield EGFP-containing fusion proteins only if a frameshift occurs and 
produces the GCA/polyAla reading frame and the MJD-Ala protein. At 72 hours, 
cells transfected with pM JD2 and, especially pM JD3, showed green fluorescence 

15 (Fig. 5c. d, e). Green fluorescence was observed within 24 hours in the positive 
control cells transfected with the pEGFP-NI vector alone or pMJD4 construct, 
where the EGFP coding sequence was in the glutaminie frame in MJD-1 (Fig. 5f, 
g). At 96 hours, and at higher magnification, frequent EGFPpositive perinuclear 
inclusions were observed in cells transfected with the pMJD3 construct (82 

20 CAGs) but not in the construct with 1 4 CAGs. and rarely in cellstransfected with 
the 37 CAG construct (Fig. 5h). These perinuclear inclusions are similar to those 
found in cell culture models of MJD, as well as other CAG tract disorders, such 
as Huntington disease^- *. 

Western blots of protein extracted from the transfected cells 

25 were probed to confirm this interpretation (Fig. 5i, j). \A/hile 1C2 detects only the 
expanded MJD gene products bearing either 37 or 82 polyglutamine repeats in 
cells transfected with pMJD2 and pMJD3 respectively (Fig. 5i), anti-ataxin-3 
detects all three different size polyglutamine-tract containing gene products (Rg. 
5j), as expected. With both antibodies, protein accumulation in the wells for 
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pMJD3 and PMJD4 was detected, in order to further determine the nature of the 
accumulated protein seen for pMJD3. the pMJDI and pMJD3 constructs were 
modified by adding epitopes for FS1 and HA in the alanine frame (pMJOS and 
pMJD6, Fig. 4). Western blots of cells transfected with these constructs were 
5 probed with anti-HA. No signal for the 14 CAG-bearing pMJ05 constmct was 
detected, but a band corresponding to aggregated protein was detected at the top 
of the gel for pMJD6, shoMring that with 82 repeats.frameshifts are occurring. The 
absence of a band coo'esponding to the predicted size of the ataxin-3/HA protein 
(Fig. 5K) indicates that all frameshifted protein is accumulating as insoluble 

10 aggregates. These experiments further demonstrate thatframeshifts do occur 
and that their frequency increases with the size of the CAG repeat. 

It shall be recognized by the skilled artisan that the in vitro 
system described above can be modified at will and still enable a determination 
of the frameshifting frequencies. Non-limiting examples of such modifications 

15 include the use of longer or shorter CAG-tracts. different reporter gene (i.e. 
luciferase) and different epitopes. In addition, such systems could be used to 
screen for drugs or compounds which modulate frameshifting and/or affect the 
level of INI formation and/or poiyalanine-protein aggregation and/or cell toxicity. 

it should also be recognized that the/n vivo methods shown 

20 above (and others) could also be used to screen drugs orcompounds which can 
modulate the level of polyalanine formation, INI formation and the like. In 
addition, these 7n vivo methods (and others) could be used to validate the effect 
of compounds or drugs identified in an in vitro assay. 
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EXAMPLE 3 
Poiyalanine-containing proteins are present in 
aggregates and are toxic to transfected COS-7 cells 
5 in a time-dependant manner 

It has been shown thatframeshifting into the alternate alanine 
frame occurs both in vitro and in vivo. To detemiine if these products are toxic 
or are simply hanmtess byproducts with no real consequences to the cell, a new 
set of MJD-1 constructs where the reading frame immediately before the CAG 
10 repeat was mutated to code for a polyalanine stretch was designed. These 
constructs, pMJD9 and pMJDIO, contained stretches of 14 GCA repeats and 82 
OCA repeats respectively, and were fused in frame to the EGFP gene (Fig. 4). A 
HA tag was also added at the COOH-terminus in frame with the GCA stretch. 
COS-7 cells were transfected with these constructs, and with pMJD7 and pMJD8, 
15 which contained 14 CAG repeats and 82 CAG repeats respectively, as well as 
EGFP fused in frame with the CAG tracts. In addition, a HAepitope was also 
added at the COOH-terminus in frame with the GCA tracts, and so only 
detectable if frameshifts occurred (Fig. 4). 

In a time-course experiment using anti-HA antibody as a 
20 probe to detect only protein frameshifted to the alanine frame, cells were collected 
and immunostained cells at 8, 12, 16, 20, 24, 48 and 72 hours. Cellstransfected 
with the CAG/GIn constructs pMJD7 and pMJD8 showed faint background 
staining at 8 hours (Fig. 6a, c). Positive signal was detected for these two 
constmcts at 12 hours (not shown) in the form of intranuclear inclusions (typically 
25 one or two per nucleus). At 20 hours, nuclei of cellstransfected with pMJD7 and 
pMJD8 contained inclusions, but were morphologically normal (Fig. 6f, h). At 24 
hours, cells transfected with the shorter construct remained morphologically 
normal (Fig. 6k), but cellstransfected with the construct bearing 82 CAG repeats 
started showing some perinuclear and cytoplasmic inclusions in addition to the 
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intranuclear aggregates (Fig. 6nfi). At this time-point about 85% oftransfected 
cells have inclusions when transfected with pMJDS. whereas only 40% of 
transfected ceils show inclusions when transifected with pMJD7. suggesting that 
frameshrfts occur with both constructs, but more frequently for the longer repeat. 
5 Nuclei of cells stained at 48 hours showed some membrane disintegration with 
both constructs, but cells containing the longer CAG repeat also had perinuclear 
and cytoplasmic inclusions (Fig. 6p. r). indicating a more severe phenotype. It is 
useful to reiterate that probing cells transfected with the CAGGIn constructs with 
anti-HA will only detect frameshifted protein. These species are detected earty 
10 in the transfection and only as INI. suggesting that, despite the probable rarity of 
frameshifts, they are producing proteins that accumulate as highly insoluble INI. 

inclusions for l>oth GCA/Ala constructs (14A and 82A) as 
early as 8 hours after transfection were detected (Fig. 6b, d). Typically, cells have 
one major perinuclear or cytoplasmic inclusion and abnormal nuclear 
15 morphology. The cellular phenotype progresses rapidly with time in cells 
transfected with either pMJD9 (Fig. 6g. I. and q) or pMJDIO (Fig. 6i, n. s). and 
was extremely severe when compared with the CAG/GIn counterparts of the 
same repeat size (for example: compare panels m and n in Fig. 6). Cells 
transfected with the GCA/Ala constructs showed abnormal nuclear structure and 
20 aggregate fomiation mainly in the cytoplasm, usually with one majorjuxtanuclear 
inclusion and what appears to be cytoskeletal reorganization. Similar results 
were obtained in cells transfected in parallel with the same constructs but probed 
with FS1 antiserum (results not shown). At all time-points cellstransfected with 
the pEGFP-NI vector alone showed only background staining and were devoid 
25 of inclusions (Fig. 6e, j, o, t). 

Western blots of protein extracted from the transfected cells 
were performed to investigate the nature of the inclusions found. Probing the 
blots with anti-HA detected a signal for both GCA/Ala constructs (pMJD9 and 
pMJDIO), where the HA tag is in the main reading frame (figure 7a). No signal 
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was detected wtth equivalent exposure for the CAG/GIn constructs, but 
overexposure of the blots revealed the presence of smears and protein in the 
wells for these constructs as well. In addition, while discrete bands are resolved 
with pMJDS. signal is only seen in the stacking gel with pMJDIO, suggesting that 
the larger polyAla-containing proteins are very insoluble and do not migrate into 
the gel at all. Probing the same blots with 1C2 confirms the presence ofpolyGIn- 
contatning proteins in the cells transfected with pMJD8, and the absence of such 
proteins in cells transfected with frameshifted constructs (figure 7b). The 
presence of signal in the stacking gel for pMJD8 may result from the presence of 
enough Gin residues to allow detection by the 1C2 antibody. It could also be due 
to the recruitment of the intact non-frameshifted protein into the insoluble 
aggregates, recruitment of hybrid polyAla/polyGIn protein, or polyGIn protein 
accumulation independent of polyAla polymers. 

EXAMPLE 4 

15 PolyGCA/polyAla stretches alone are toxic and sufficient 

for formation of aggregates 

in order to determine whether a construct with only a GCA 
tract encoding an alanine peptide, outside the context of the ataxin-3 protein, 
would be sufficient to produce a cellular phenotype, a truncated (GC/^42 construct 

20 with FS1 and HA epitopes in frame with GCA was transfected into COS-7 cells 
(pMJD1 1 . Fig. 4). In this construct, the MJD-1 sequence was truncated so that 
the resulting protein will only have 25 amino acid residues left upstream from the 
repeat, and only the FS1 and HAejMtopes after the GCA tract. At 24 hours, cells 
showed perinuclear and cytoplasmic aggregates and an abnormal riuclear 

25 morphology (Fig. 8a), a phenotype that is very similar to the one obtained by 
transfection of the full-length GCA/Ala constructs described above. Mock- 
transfected cells showed background staining and absence of inclusions (Fig. 
8b). These findings indicate that the presence of a frameshifted CAG repeat 
product is sufficient for toxicity, independent of the protein context, and suggest 
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a similar pathogenic mechanism may be operating in the other diseases caused 
by expanded CAG tracts. This finding is thus of relevance to expand the 
teachings of the present invention to all diseases or conditions caused by 
expanded CAG tracts and to diseases and conditions caused by the accumulation 
5 of polyaianine-containing proteins. 

EXAMPLE 5 
Antisera production 

A 1 2-mer peptide corresponding to the new predicted CCXDH- 
1 0 terminus of the ataxin-3 protein after frameshHt occurs (AAGPIRTEFTSM) was 
used to raise antisera from two rabbits, FS1 and FS2- Injections were performed 
using 0.5 mg of peptide conjugated to KLH and sera were collected using 
standard protocols*^. 

EXAMPLES 

15 Western Blots 

For protein extraction. MJD and control lymphoblastoid cells 
were lysed in buffer containing NP-40. Equal amounts of protein were 
electrophoresed in 8% SDS-polyacrylamide gels and transblotted to nitrocellulose 
membranes (as commonly known). Immunodetection was performed using FS1 
20 (1:300), FS2 (1:300), anti-ataxin-3 (1:1000). 1C2 (1:2000), anti-ubiquitin (1:400; 
DAKO) and FS1 and FS2 pre-immune semm (1:300). Results for FS1 and FS2 
were always consistent; all experiments were repeated three times on different 
blots. HRP conjugated secondary antibodies were used at a 1:10.000 dilution. 
COS-7 cells transfected with various constructs were collected, washed and lysed 
25 in Sample Loading Buffer. 100 pg of each sample was used to run on a 10% 
SDS-PAGE and transblotted onto nitrocellulose membranes. Immunodetection 
was performed using antisera at following dilutions: 1C2 (1 :5000), anti-ataxin-3 
(1:2000) and anti-HA (1:1000). Results were visualized by chemiluminescence 
(RENAISSANCE). 
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EXAMPLE 7 
Immunocytochemistry 

MJD and control lymphoblastoid cells were harvested and a 
total of 50x10^ cells from each cell line were plated onto poly-D-lysine coated 

5 slides and fixed with acetone/methanol (1:1). Immunodetection was performed 
using FS1 (1:300). anti-ubiquitin (1:300). anti-ataxin-3 (1:500) and FS1 pre- 
immune serum (1:300). Biotinylated secondary antibodies were used at a 1:500 
dilution and an amplification step was perfomned using the ABC kit (VECTOR). 
Reaction product was visualized using the VIP kit (VECTOR). Immunocyto- 

1 0 chemistry on COS-7 cells was performed on cover slips. At each time-point cells 
were fixed with 4% paraformaldehyde and immunodetection was performed using 
anti-HA probe (1:500) and FS1 (1:300). Secondary antibodies and subsequent 
amplification and detection procedures were carried out as described above. 

EXAMPLE 8 

15 Immunohistochemistry 

For immunohistochemistry of brain sections, 5 pm sections of 
paraffin-embedded tissue from the pons of an MJD patient and a control subject 
were used. Sections were deparaffinized, permeabilized and immunostained with 
FS1 (1 :50) and anti-ubiquitin (DAKO) (1 :300). Biotinylated secondary antibodies 

20 were used at a 1:500 dilution and an amplification step was performed using the 
ABC kit (VECTOR). Reaction product was visualized using the VIP kit 
(VECTOR). Forcoimmunofluofescence of FS1 and anti-ubiquitin antitxsdies in 
brain sections, the same procedure for preparation of samples was followed. 
Immunodetection was performed using FS1 (1:50) and monoclonal anti^ibiquitin 

25 (ZYMED) (1:300). A mixture of Cy3-conjugated anti-mouse antibody (1:1 00) and 
fluorescein-conjugated anti-rabbit antibody (1 :50) was used as secondary probe. 
Sections were mounted in SlowFade (Molecular Probes). 
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EXAMPLE 9 

Plasmid construction, transfectlon and cell culture 

DNA amplification was perfonned using Pfu DNA polymerase 
(STRATAGENE). Primer MJD-5': TTTTAAGCTTAGACAAA-TAAACATGGAG 
5 (SEQ ID NO:1) was used in conjunction with MJD-3': 
CCGGTGGATCCCTCATCCTGATAGGTCCCGCTGCTG (SEQ ID NO:2) for 
pMJD1, pMJD2 and pMJD3, or MJD-3'c: CCGGTGGATCCCTCA- 
TTGATAGGTCeCGCTGCTG (SEQ ID NO:3) for pMJD4 (STRATAGENE). 
Primer MJD-5'(Hm): TTTAAGCTTCCCACCATGGAGT-CATCTTCCA (SEQ ID 

10 NO-4) was used in conjunction with MJD-3'(BI): 
CCGGTGGATCCCTCAGGGCGTAGTCGGGGACGTCGTAGGGGTACATGGAT 
GTGAACTCTGTCCTGATAGGTCCCGCTG (SEQ ID NO:5). for pMJD5 and 
pMJD6, or MJD-3'c(BI): CGGGTGGATCCCAGGGCGTAGTCGGGGACGTCG- 
TAGGGGTACATGGATGTGAACTCTGTCCTGATAGGTCCCGCTG (SEQ ID 

15 NO;6) for pMJD7 and pMJD8, or MJD-Ala: 
CGGAAGAGACGAGAAGCCTACTCCGGAAAAACAGCAGCAAAA-GCAGC 
(SEQ ID NO:7) for pMJD9. pMJDIO pMJDH. Amplified products were digested 
with BamHI and Hindlll and doned into plasmid pEGFP-NI (CLONTECH). except 
for pMJDi 1, for which the amplified product was cloned into a modified version 

20 of the pEGFP-NI plasmid lacking the EGFP gene. All constructs were confirmed 
by sequencing (as commonly known). COS-7 cells were seeded in Dulbecco's 
modified Eagle's medium (DMEM) containing 10% fetal calf serum the day before 
transfection at 2 x 10^ per well in 6-well plates containing sterile coverslips. COS- 
7 cells were transfected with plasmid DNA (2.0 pg) using iipofectamine reagent 

25 (GIBCO BRL) according to the manufacturer's instructions. For the experiment 
depicted in figure 4. after 72-96 hours, the cells were fixed with PBS/4% 
paraformaldehyde and observed under a fluorescent microscope with FITC filter 
in four independent experiments. 



CONCLUSION 
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The present invention thus shows that transcriptional or 
translational frameshifts occurring within expanded CAG tracts result in the 
production and accumulation of polyalanine-containing mutant proteins. These 
alanine polymers might deposit in cells forming INI and lead to nuclear toxicity. 
5 Support for this disease model is provided using lymphoblast cells from MJD 
patients, as well as in pontine neurons of MJD brain and in in vitro cell culture 
models of the disease. Evidence that alanine polymers alone are toxic to ceils is 
also provided and strongly suggests that a similar pathogenic mechanism 
underlies the other CAG repeat disorders. How these accumulations lead to cell 
10 , death still needs to be elucidated. 

Although the present invention has been described 
hereinabove by way of preferred embodiments thereof it can be modified, without 
departing from the spirit and nature of the subject invention as defined in the 
appended claims. 
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WHAT IS CLAIMED IS: 

1 . A method for the diagnosis of a disease associated with 
protein accumulation in intranuclear inclusions in a cell of a patient, which 

5 comprises: 

a) obtaining a sample from said patient; and 

b) determining a presence of said protein accumulation in 
said intranuclear inclusions 

wherein said protein accumulation in said intranuclear inclusions is indicative of 
10 a disease associated therewith. 

2. The method of claim 1, wherein said protein is a 
poiyaianine-containing protein. 

3. The method of claim 1 or 2. wherein said disease is a 
neurological disease. 

15 4. The method of claim 1 , 2 or 3. wherein said detemnining is 

carried out with one of a ligand and/or a nucleic acid sequence. 

5. A method for the screening of agents which can modulate 
at least one of: (a) polyamino acid stretch-containing protein expression: (b) 
accumulation of polyamino acid stretch-containing proteins in intranuclear 
20 inclusions: and (c) toxicity of polyamino acid stretch-containing proteins to cells, 
which comprises: 

a) incubating a cell which expresses a polyamino acid stretch-containing 
protein, associated with a disease or condition in an animal, with a 
compound: and 
25 b) assessing one of a) to c): 

whereby a modulator is selected when said agent significantly modulates one of 
said expression, accumulation and toxicity, as compared to a control agent. 
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6. The method of claim 5, wherein said polyamino acid 
stretch-containing protein is a potyaianine stretch-containing protein. 

7. The method of claim 5 or 6, wherein said polyamino acid 
5 stretch-containing protein is expressed by an expression vector which comprises 

a repeat domain. 

8. The method of claim 7. wherein said polyamino acid 
stretch-containing protein is a polyalanine stretch-containing protein. 

10 

9. The method of claim 8. wherein said polyalanine stretch is 
encoded by a CAG repeat. 

10. The method of claim 9, wherein said CAG repeat is an 
1 5 uninterrupted CAG tract. 

1 1 . The method of claim 5, wherein said cell is selected from 
a lymphoblast cell from a Machado-Joseph disease (MJD) patient, a pontine 
neuron of MJD brain and an in vitro cell culture model of a neurological disease 

20 associated with said polyamino acid stretch-containing protein. 

12. A method to trigger toxicity in a cell comprising an 
increased expression of an alanine polymer stretch in a protein. 

25 13. The method of claim 8, wherein said polyalanine stretch is 

encoded by a GCG repeat. 

14. The method of claim 5, wherein said cell is isolated from 
an oculopharyngeal muscular dystrophy (OPMD) patient. 
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15. The method of cfaim 13 or 14, wherein said GCG repeat 
is present in the PABP2 gene. 
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<12 0> DIAGNOSIS, PROGNOSIS AND TREATMENT OF TRINUCLEOTIDE 
REPEAT-ASSOCIATED DISEASES AND INTRANUCLEAR 
INCLUSIONS-ASSOCIATED DISEASES 

<130> 1619.0110000 

<140> 10/070,664 
<141> 2000-09-08 

<150> PCT/CAOO/01052 
<151> 2000-09-08 

<150> 60/152,941 
<151> 1999-09-08 

<160> 8 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 1 

ttttaagctt agacaaataa acatggag 



<210> 2 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 2 

ccggtggatc cctcatcctg ataggtcccg ctgctg 



<210> 3 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 
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<400> 3 

ccggtggatc cctcattgat aggtcccgct gctg 



<210> 4 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

oligonucleotide 

<400> 4 

tttaagcttc cca'ccatgga gtcatcttcc a 31 



<210> 5 

<211> 78 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 5 

ccggtggatc cctcagggcg tagtcgggga cgtcgtaggg gtacatggat gtgaactctg 60 
tcctgatagg tcccgctg 78 



<210> 6 
<211> 76 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 6 

ccggtggatc ccagggcgta gtcggggacg tcgtaggggt acatggatgt gaactctgtc 60 
ctgataggtc ccgctg 76 



<210> 7 

<211> 47 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 7 

cggaagagac gagaagccta ctccggaaaa acagcagcaa aagcagc 47 



<210> 8 

<211> 12 

<212> PRT 

<213> Homo sapiens 



-3- 



<400> 8 

Ala Ala Gly Pro lie Arg Thr Glu Phe Thr Ser Met 
15 10 
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